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PACS 87. 16. Yc - Regulatory genetic and chemical networks 

PACS 89 . 90 . +n - Other topics in areas of applied and interdisciplinary physics 

Abstract - We analyze the gene expression data of Zebrafish under the combined framework of 
complex networks and random matrix theory. The nearest neighbor spacing distribution of the 
corresponding matrix spectra follows random matrix predictions of Gaussian orthogonal statistics. 
Based on the eigenvector analysis we can divide the spectra into two parts, first part for which 
the eigenvector localization properties match with the random matrix theory predictions, and the 
second part for which they show deviation from the theory and hence are useful to understand 
the system dependent properties. Spectra with the localized eigenvectors can be characterized 
into three groups based on the eigenvalues. We explore the position of localized nodes from these 
different categories. Using an overlap measure, we find that the top contributing nodes in the 
different groups carry distinguished structural features. Furthermore, the top contributing nodes 
of the different localized eigenvectors corresponding to the lower eigenvalue regime form different 
densely connected structure well separated from each other. Preliminary biological interpretation 
of the genes, associated with the top contributing nodes in the localized eigenvectors, suggests 
that the genes corresponding to same vector share common features. 



Introduction. — Gene expression information cap- 
tured in microarrays data for a variety of environmental 
and genetic perturbations promises to yield unprecedented 
insights into the organization and functioning of biologi- 
cal systems [IH1] . The challenge no longer lies in obtaining 
gene expression profile, but rather in interpreting the re- 
sults to gain insight into biological mechanisms. It has 
been increasingly realized that dissecting the genetic and 
chemical circuitry prevents us from further understand- 
ing the biological processes as a whole. In order to un- 
derstand the complexities involved, all reactions and pro- 
cesses should be analyzed together. To this end, network 
theory has been getting fast recognition to study systems 
which could be defined in terms of units and interactions 
among them [5H7]. In this view one approach is to study 
the co-expression of genes, and to build up gene-sets work- 
ing together. A gene co-expression network is defined by 
a set of nodes corresponding to genes, and a list of edges 



corresponding to co-expression. Using gene co-expression 
to recover co-regulated genetic modules is a standard ap- 
proach adapted in system biology [9]. We utilize gene 
expression data from Zebrafish exposed to various toxi- 
cants as study model [8] . The Zebrafish is an increasingly 
popular model not only for vertebrate development [10] 
but also for understanding human diseases [TT] and tox- 
icology [12] . We analyze the gene co-expression network 
constructed from Zebrafish data under the random matrix 
theory (RMT) framework. 

RMT was initially proposed to explain the statistical 
properties of nuclear spectra [13) . Later this theory was 
successfully applied in the study of the spectra of differ- 
ent complex systems such as disordered systems, quantum 
chaotic systems, and large complex atoms [H]. Further 
studies illustrate the usefulness of RMT in understanding 
the statistical properties of the empirical cross-correlation 
matrices appearing in the study of multivariate time series 
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of price fluctuations in the stock market [TS], EEG data 
of brain [16] , variation of various atmospheric parameters 
[17], etc. Recent analysis of complex networks under RMT 
framework [18H21] shows that various model networks as 
well as real world networks follow universal GOE statis- 
tics. The analysis of protein-protein interaction network 
of budding yeast reveals that the nearest neighbor spac- 
ing distribution (NNSD) of the spectra of the correspond- 
ing matrix follows RMT prediction |18j . This promising 
result suggests that these networks can be modeled as 
a random matrix chosen from an appropriate ensemble. 
Recently, covariance matrix of amino acid displacement 
[21] and gene co-expression network [22] constructed us- 
ing gene expression profiles from human brains |23j have 
been analyzed under RMT framework. These analyses 
also show that the bulk of eigenvalues of corresponding 
networks follows universal GOE statistics of RMT. The 
universal GOE statistics of eigenvalues fluctuations can 
be understood as some kind of randomness spreading over 
the real and model networks [24]. 

In this letter, we analyze the gene co-expression data 
under the random matrix theory framework. We find that 
the bulk of the spectra follows random matrix predictions 
of the GOE statistics. Rest part of the spectra deviates 
from the universality. We explore the properties of eigen- 
vector from this part of the spectra in detail. Particularly, 
we study the localization behavior of the spectra of the un- 
derlying matrix, and investigate the structural properties 
of the top contributing nodes in the localized eigenvectors. 
We introduce an overlap measure to understand the struc- 
tural properties of the nodes which are picked up based on 
the spectral properties of the underlying network. 

Method and Techniques. 

Construction of gene co-expression network. One key 
problem in constructing gene co-expression network is to 
detect truly co-expressed gene pairs from genomic gene 
expression data [25]. Pearson product method has been 
used traditionally to calculate correlation between pairs 
of genes using different conditions. Pearson correlation 
coefficient (PCC) between two variables is defined as the 
covariance of the two variables divided by the product of 
their standard deviations: 



Px,y 



cov(X,Y) _ E[{X - n x )(Y - ny)} 



(1) 
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The absolute value of PCC is less than or equal to 1. Cor- 
relations equal to 1 or -1 correspond to maximum cor- 
relation and maximum anti-correlation respectively. The 
Pearson correlation coefficient is symmetric: p{X, Y) = 
p(Y,X). Since in the microarrays studies the number 
of samples is often limited, it is very crucial to find the 
most robust method to construct a co-expression network 
with the lowest effect of the sample size. To handle small 
sample size problem, PCC is calculated following a re- 
sampling bootstrap approach |26] . The co-expression net- 
work is constructed, where two genes have weighted link 



if the correlation coefficient for that pair is greater than 
the value 0.5, and the 95% bootstrap confidence interval 
contains only positive numbers [27) . The corresponding 
network has entries one or zero depending upon whether 
the link between i and j is present or absent. The original 
data set [5] has 4021 genes, after applying the threshold 
value th = 0.5, we get the largest connected network with 
N = 4016 nodes. The network is weighted with the dis- 
tribution lying between 0.5 < \wij \ < 1. 

RMT Techniques. In the following we briefly de- 
scribe some of the RMT techniques used in our inves- 
tigation. We denote the eigenvalues of a network by 
Aj, i = 1,...,N, where N is size of the network and 
Ai < A2 < A3 < • • • < Xn- The density distribution 
p(X) follows semi-circular distribution for GOE statistics. 
In order to calculate spacing distribution of eigenvalues, 
one has to remove the spurious effects due to the varia- 
tion of spectral density and to work at constant spectral 
density on the average. Thereby, it is customary in RMT 
to unfold the eigenvalues by a transformation Ai = N(Xi), 
where N(X) = J x p(A') dX' is the averaged integrated 
eigenvalue density |13j . Since analytical form for TV is 
not known, we numerically unfold the spectrum by poly- 
nomial curve fitting. The nearest neighbor spacing dis- 
tribution (P(s)), where Si = Aj+i — Ai, of eigenvalues 

follows P(s) = fsexpf-^) for GOE statistics. The 

distribution of the eigenvectors components provides sys- 
tem dependent information. Let E k is the Ith component 
of fcth eigenvector E k . The eigenvector components of 
a GOE random matrix are Gaussian distributed random 
variables. For this case, the distribution of r = l-E^I 2 , in 
the limit of large matrix dimension, is given by the Porter- 
Thomas distribution |28j . The inverse participation ratio 
(IPR) provides information about the localization proper- 
ties of the eigenvectors The IPR of a eigenvector is 
defined as 

rk (E^lW) 3 
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The meaning of I k is illustrated by the two limiting cases 
: (i) a vector with identical components E k = 1/yN 
has I k = N, whereas (ii) a vector, with one component 
E\ = 1 and the remainders zero, has I = 1. Thus, the 
IPR quantifies the reciprocal of the number of eigenvector 
components that contribute significantly. A vector with 
components following the Porter-Thomas distribution has 
I k ~ N/3. 

Results. — After applying the threshold we get the 
largest connected network with size N = 4016 and E = 
483148 weighted edges. The weights are distributed be- 
tween —0.5 and 0.5. The average degree of this largest 
connected component is calculated as < k >^ 120, and 
the degree distribution is shown to have exponential de- 
cay. The above number of edges yield a densely connected 
network. Higher values of threshold generate sparser net- 
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Fig. 1: (Color online) Left subfigure plots eigenvalue distribu- 
tion for the largest connected cluster for the threshold value 
th = 0.5. Inset plots eigenvalues in the increasing order. Right 
subfigure plots IPR as a function of eigen number i. 

works, with number of nodes in the largest connected clus- 
ter lesser. The value of threshold th = 0.5 is chosen in 
such a manner that it is sufficiently low to get almost 
all nodes into the largest connected cluster, and is suffi- 
ciently high to minimize the noise and measurement ef- 
fect on the network [3011311 ■ The bulk of the eigenvalues 
of this network lies roughly between —16 and 30. Last 
few eigenstates have steep increase in their values, with 
largest eigenvalue \ max = ~ 155 being well separated 
from the bulk. The figure Q] plots eigenvalues distribution, 
which shows triangular shape with exponential decay at 
both the ends. Using the unfolded spectrum, we calculate 
the nearest neighbor spacing distribution P(s), and find 
that it follows GOE statistics of RMT. 

Figure (TjTJ) shows IPR as a double humped well. For a 
network of size N = 4016, the IPR value for the eigenvec- 
tors following RMT prediction would be ~ 1338. Based 
on the eigenvector localization values, the spectra can be 
divided into two parts, one part with the delocalized eigen- 
vectors having value close to the RMT prediction, and an- 
other part which consists the localized eigenvectors. Ac- 
cording to RMT, this indicates that the corresponding net- 
work has a mixture of random connections yielding the de- 
localized eigenvectors of the first part, and the structural 
features corresponding to functional performance leading 
to the localized second part. In order to get insight to the 
system dependent properties, we probe localized eigenvec- 
tors further. Based on the corresponding eigenvalues, the 
localized part of the spectra can further be divided into 
three distinct sub-parts, which we would discuss in detail. 
The first localized part (A) is associated with the lower 
eigenvalues regime, the second localized part (B) corre- 
sponds to the middle part of the spectra near the zero 
eigenvalue, and the third localized part (C) corresponds 
to the eigenstates with larger eigenvalues. 

We make following general observations: the eigenvec- 
tors belonging to the part (A), in general, have the top 
contributing nodes with high degrees. The eigenvectors 
belonging to the part (B) have as few as one or two top 
contributing nodes. Additionally, these top contributing 
nodes have as few as one or two degrees. The eigen- 
vectors belonging to the part (C) have top contributing 
nodes with degree close to the average degree of the net- 
work. The eigenvectors belonging to the part (C) do not 
have distinguished nodes contributing much higher than 
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Fig. 2: (Color online) Typical behavior of the eigenvector ele- 
ments from sets (A), (B) and (C). n denotes the eigenstate, and 
|-E™| 2 denotes the ith component of the nth eigenvector. Nodes 
are reordered such that first top contributing node, where con- 
tribution is measured by (E™) 2 , gets index i being 1, and last 
contributing node gets index i — N. 

the rest. Note that for finite dimensional matrix, devia- 
tion from randomness determines the localization length 
of the eigenvectors [33] . Figure © plots the square of the 
eigenvector components for few eigenvectors lying at the 
bottom of the IPR values. The different nature of eigen- 
vector components in these three parts can clearly be seen. 
The eigenvectors from set (A) show the localization on ap- 
proximately ten to twelve nodes, which contribute to the 
IPR highly. In the reordered nodes, as shown in the Fig- 
ure i[2]). contributions from the top ten- twelve nodes decay 
with the node number, and finally reach to a plateau with 
very small values of (-E™) 2 for rest of the nodes. The eigen- 
vectors in set (B) are highly localized on few nodes, namely 
one or two nodes, rest of the nodes lie towards the bottom 
of the participation measured by (£7™ ) 2 . For the eigenvec- 
tors in set (C), the top contributing nodes give approxi- 
mately same amount of contribution to the participation 
measure. The top 50 localized eigenvectors, except few, 
correspond to the eigenstates with the set (A), i.e. with 
the negative eigenvalues. Note that the eigenstate, corre- 
sponding to the largest eigenvalue (Ajv), has exponential 
decay components, and is not localized to few nodes. 

In the following we analyze the top contributing nodes 
of eigenvectors lying towards the bottom of the IPR table. 
First, we analyze these nodes under the network theory 
framework, i.e. based on their degree and position in the 
network. We use the overlap measure to probe further 
the structural properties of the nodes. Following this, we 
would note some of their functional relevance. 

The top contributing nodes corresponding to the eigen- 
vectors in the set (B), listed in the Ref. [33], are few, and 
interestingly one node appears in most of the localized 
eigenvectors in this set. The first localized eigenvector 
corresponds to the eigenstate n = 2289. It has one local- 
ized node well separated from the others. This localized 
node comes at the number 3884 in the largest connected 
component. The node lies to the periphery of the network 
with only one neighbor connected to it. The neighboring 
node A/3884 € {1185} has a very large degree fcnss = 145. 
The symbol fcj denotes the degree of the zth node, and 
A/i G {ji, ]2----jk} tells that the node i has ki neighbors, 
namely, node ji , node ji an d so on. 

The next localized eigenvector of set (B) corresponds 
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to the eigenstate n = 2357, and is localized on two nodes 
n = 4011 and n = 4016. The node number 4011 is con- 
nected to only one node 3038 with the degree £3038 = 19. 
The neighbors of n = 3038 has varying degrees ranging 
from as low as 1 to as high as 258. The overlapping be- 
tween all the pairs of nodes from the set of A/3038 are l ess 
or equal to the overlapping in the corresponding random 
network. Node number n = 4016 is connected with two 
other nodes A/4016 G {3806,3971} with degrees 30 and 
13 respectively. Furthermore, these neighbors also do not 
have any common neighbors. 

In order to quantify the overlapping between neighbors 
of two nodes i and j, here we define a measure as, 



13 min(NNi,NNj) v ' 

Where NNi is the number of neighbors of node i, and 
NNij indicates the number of common neighbors between 
the nodes i and j. Equation [3] measures the fraction of 
the overlapping neighbors of nodes i and j. The value 
Oij = corresponds to the case when the nodes i and j 
do not share any common neighbor, and the value Oij = 1 
corresponds to the situation when all the neighbors of the 
node i (j) are within the set of neighbors of the node j 

d). 

First, we calculate the overlap between all the pairs 
of nodes i and j, where i is the neighboring node 3038 of 
the first top contributing node n = 4011, and the node j 
being any of the two neighbors 3806, 3971 of the second top 
contributing node 4016. The zero value of Oij tells there 
is no overlapping even between the next to the nearest 
neighbors of the two top contributing nodes from the first 
localized eigenvector of set B. 

The next two localized eigenvectors of the set B, cor- 
responding to eigenstates n — 2258 and n — 2282, also 
have one localized node 3884 well separated from the oth- 
ers. The next two eigenvectors are localized on two nodes. 
Additional to the node number 3884 which is the top con- 
tributing in the previous eigenvectors, there are two other 
nodes, 3008 and 4011, appearing for the eigenvectors cor- 
responding to n = 2298 and n = 2307 states respectively. 

The localized eigenvectors from set (B) show entirely 
different features than the eigenvectors belonging to other 
two sets (A) and (C). The eigenvectors in this set not only 
have as few as one or two localized nodes, but also these 
nodes have few number of distinct neighbors. 

Figure ([3]) shows the section of the network consisting of 
the top contributing nodes, of the localized eigenvectors of 
set (B). The sub-graph shows all the first (light gray solid 
circles, cyan) as well as second (light gray dots, orange, to 
the end of all links) neighbors of these nodes. The node 
number 3008 (dark gray, green, solid circle towards bottom 
left) has three neighbors A/3008 G {293,302,1821} (light 
gray, cyan, solid circles connected with the node number 
3008) with large degrees. The first pair of A/3008 has 30 % 
overlapping neighbors whereas other two pairs have ~ 10% 




Fig. 3: (Color online) Part of the network consisting nodes 
and neighbors of the nodes corresponding to the set (B), see 
text for details. The top contributed nodes of the localized 
eigenvectors of this set are denoted by dark gray solid circles 
(green) . The first neighbors of these nodes are denoted by light 
gray solid circles (cyan), and the second neighbors are denoted 
by the light gray dots (orange). 

and ~ 20% overlapping neighbors. Further more, we cal- 
culate Oij between all the pairs of i and j, where % denotes 
the neighbors of the first contributing node n = 3884, and 
j denotes the neighbors of the second contributing node 
n = 3008. The values of Oij for such pairs are close to the 
value of Oij for the corresponding random network. Simi- 
larly, the two top contributing nodes from the eigenvector 
^(2307) nave n0 next to the nearest common neighbors. 

All the above observations suggest the followings: the 
top contributing nodes in the eigenvector from set (B) are 
either located on the periphery of the network, or they 
serve as a bridge to the several loosely separated commu- 
nities. The top contributing nodes from the same eigen- 
vectors from set (B) belong to the well separated different 
communities. 

The top contributing nodes from set (A), which con- 
sists most of the localized eigenvectors, form a separate 
set for each eigenvector. The top most localized eigenvec- 
tor corresponds to the minimum eigenvalue (n = 1). Ten 
top most contributing nodes for this eigenvector, in se- 
rial, are: 99,1101,64,129,245,1243,1238,17,91,226. All 
of these nodes have very high degrees, and each node sep- 
arately form clique of order 10 with its neighbors. 

In order to measure the overlapping between the neigh- 
bors of the top contributed nodes, we calculate Oij for each 
pair of the nodes appearing in the top ten list of the dif- 
ferent eigenvectors. Figure([4j plots the overlapping mea- 
sure Oij for the four most localized eigenvectors. As can 
be seen from the figure that the values of Oij for the pair 
of neighbors of the nodes from _E( 4016 ) are very high. The 
overlapping for the pair of nodes beyond ten are inconsis- 
tent. Note that the network has the size N = 4016 and 
the average degree < k >~ 120, the number of common 
neighbors for a pair of nodes i and j in a corresponding 
random network would be of the order of Oij ~ 0.03. Fig- 
ure (HJ) shows that the overlapping between neighbors of 
the top ten contributing nodes of eigenvector E^ 4016 ) is an 
order of magnitude larger than that of the corresponding 
random network. The second most localized eigenvector 
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Fig. 4: (Color online) Fraction of overlapping neighbors Oij 
for the pairs of nodes i and j. The horizontal label shows the 
pair index Ej, i and j denoting two different nodes, and the 
vertical axis plots the overlapping between the nodes i and 
j. The overlapping measure for the nodes within the eigen- 
vectors E 1 ,E 2 , E 3 , E 4 and \max are plotted with the different 
gray shades towards the top of the figure, and the overlap- 
ping measure for the pairs of nodes belonging to the different 
eigenvectors are plotted with different gray shades towards the 
bottom of the figure. 



corresponds to the eigenvalue A5. The top ten most con- 
tributing nodes again have degree towards the higher side, 
and they form completely different set than the top con- 
tributing nodes in the previous eigenvector. These nodes 
also form a clique of the order ten. Figure (@| shows that 
the overlapping Oij between all the pairs of the top ten 
contributing nodes are very high. Similarly, the third and 
the fifth most localized eigenvectors also have completely 
different sets of the top ten contributing nodes. Note that 
the fourth most localized eigenvector belongs to the set 
(C). Figure ((4]) plots Oij for all the pairs of top ten nodes 
corresponding to the fourth eigenvector as well. 

Furthermore, Oij between the nodes from the different 
top eigenvectors for the set (A) are very low, Figure 
shows that the overlapping between the different pairs of 
the nodes from the top four eigenvectors are of the or- 
der of the corresponding random network. Note that the 
degree of these nodes are very high, but they form differ- 
ent subgroups corresponding to each eigenvector, without 
significant common nodes even with the neighbors of the 
nodes from the different eigenvectors. 

Furthermore, overlapping between the neighbors of the 
nodes belonging to the different localized eigenvectors 
from set (A) are few. The values of Oij, where j is a 
node of one eigenvector say E^ and j is a node from an- 
other eigenvector of set A, say E 1 , comes in the bottom 
of the figure |U All this suggests that the top contribut- 
ing nodes in the different eigenvector from set (A) form 
densely connected community structure within a eigenvec- 
tor, and which is loosely connected with the nodes from 
the other eigenvectors. 

The first most localized eigenvector in the set (C) corre- 
sponds to the A4014. Though for this set, not much distinct 
localized nodes exist, the top ten contributing nodes have 
their degrees near to the average degree of the network, 
and these nodes are densely connected with each other. 
The very high values of Oy in the Figure ((4]) for A4014 



indicate that, all the pairs of top contributing nodes share 
large number of neighbors. These nodes form entirely dif- 
ferent set than the top ten nodes of the eigenvector cor- 
responding to the maximum eigenvalue \ ma x- The eigen- 
vector corresponding to the maximum eigenvalue comes at 
the number of 44 in the terms of localization. Most of the 
top ten contributing nodes for this eigenvector are same 
as those for the first localized eigenvector which belongs 
to the set (A). Contrary to the top nodes in the other 
eigenvectors of the set (C), the top contributing nodes for 
have de gree much larger than the average degree of 
the network. These nodes form a dense community with 
as high as ~ 80% of overlapping neighbors between each 
pair of the nodes. The second most localized eigenvector 
from this set (C) corresponds to the eigenstate n = 4009. 
The average degree of the top ten contributing nodes is 
close to the average degree of the network, and the nodes 
show very high overlapping between the neighbors. 

Conclusion and Discussion. — We analyzed spectra 
of the gene co-expression network of Zebrafish generated 
for different environmental perturbations. The eigenval- 
ues statistics of the corresponding matrix shows triangu- 
lar distribution with exponential decay at both the ends. 
Triangular distribution is one of the known characteris- 
tics for scalefree networks. The spacing distribution of 
the eigenvalues follows GOE statistics of RMT, which in- 
dicates that there is sufficient amount of randomness |24] 
existing in the co-expression network. The eigenvector 
localization measured by the IPR values shows that the 
bulk of the spectra follows RMT predictions of the GOE 
statistics. The remaining part of the spectra which devi- 
ates from the RMT predictions carries system dependent 
information. The later part of the spectra have localized 
eigenvectors, and is divided into three groups based on 
the associated eigenvalues. Moreover, top contributing 
nodes corresponding to the different groups show different 
structural features. The eigenvectors belonging to the first 
group (A) has top contributing nodes with high degrees, 
and are associated with the lower eigenvalues regime. The 
eigenvectors associated with the second group (B) have top 
contributing nodes with as few as one or two degrees, and 
are associated with the zero eigenvalue regime. The eigen- 
vectors from group (C) lie towards the largest eigenvalue, 
and have top contributing nodes with degrees close to the 
average degree of the network. Most of the top localized 
eigenvectors belong to the part (A) , i.e. correspond to the 
lower eigenvalues regime. According to the RMT, the lo- 
calized eigenvectors distinguish 'genuine correlations from 
apparent correlations' [15| which, in terms of the gene co- 
expression networks, can be interpreted as random corre- 
lations between the genes and functionally important cor- 
relations between the genes. The corresponding matrix is 
not random, and hence leads to the localization of some 
of the the eigenvectors. The top contributing nodes in the 
localized eigenvalues may have important structural and 
functional roles. 
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In order to probe the structural relevance of the top 
contributing nodes we define overlap measure. For the set 
(B), overlapping measure shows that the top contributing 
nodes lie towards the periphery of the network. Addition- 
ally, based on the overlap measure for the neighbors of the 
top contributing nodes from localized eigenvectors, wc sec 
that these nodes belong to the different regions of the net- 
work with a small overlapping between even the next to 
the nearest neighbors. The largest overlap in these pairs is 
those of the corresponding random networks. These obser- 
vations suggest that for the set (B), the top contributing 
nodes belonging to same eigenvectors lie either near to the 
periphery, or belong to the different parts of the networks. 

For the set (A) , the top contributing nodes of a eigen- 
vector have very high values of the overlap measure, infact 
for some of the localized eigenvectors these nodes are part 
of the clique of the order ten. Furthermore, the top con- 
tributing nodes from the different localized eigenvectors 
lie well separated from each other. All the observations 
for set (A) indicate that the top contributing nodes in 
different eigenvectors form separate community structure; 
these communities arc densely connected within, and arc 
loosely connected with each other. 

The top contributing nodes from the set (C) do not show 
any noticeable structural features, except the fact that, 
the nodes from each localized eigenvector form densely 
connected set though they have less number of neighbors, 
equal to the average degree of the network. 

Description in the Ref. [33] suggests that most of the 
genes in the localized sets, which are functionally related 
or are members of pathways leading to similar diseases, are 
basically clustered in one set. The exceptional cases, how- 
ever, may indicate that although these genes were identi- 
fied in the same set, having different function altogether, 
could have similar protein expression patterns that are 
regulated by similar transcriptional cues in similar devel- 
opmental domains, like that of the other members in that 
particular set. Though the biological interpretation drawn 
in the letter is at a preliminary stage, the analysis pre- 
sented here shows the applicability and the usefulness of 
random matrix theory to pick out set of nodes (genes) 
from a large number of interacting nodes (genes), which 
were not detected based on existing structural measures. 
Identification of genes that are significantly responsible for 
diverse toxicological perturbations is important in order 
to develop a future chip, that can be used for detection 
of pollutants and diagnosis of diseases. Future directions 
would involve hand in hand experiments and theoretical 
investigations to make direct relation between cause such 
as environmental perturbations and gene or set of genes 
affected. 
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